首页> 外文OA文献 >StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

【2h】

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

机译：stackGaN：使用stacked实现逼真的图像合成文本生成性对抗网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Synthesizing high-quality images from text descriptions is a challengingproblem in computer vision and has many practical applications. Samplesgenerated by existing text-to-image approaches can roughly reflect the meaningof the given descriptions, but they fail to contain necessary details and vividobject parts. In this paper, we propose Stacked Generative Adversarial Networks(StackGAN) to generate 256x256 photo-realistic images conditioned on textdescriptions. We decompose the hard problem into more manageable sub-problemsthrough a sketch-refinement process. The Stage-I GAN sketches the primitiveshape and colors of the object based on the given text description, yieldingStage-I low-resolution images. The Stage-II GAN takes Stage-I results and textdescriptions as inputs, and generates high-resolution images withphoto-realistic details. It is able to rectify defects in Stage-I results andadd compelling details with the refinement process. To improve the diversity ofthe synthesized images and stabilize the training of the conditional-GAN, weintroduce a novel Conditioning Augmentation technique that encouragessmoothness in the latent conditioning manifold. Extensive experiments andcomparisons with state-of-the-arts on benchmark datasets demonstrate that theproposed method achieves significant improvements on generating photo-realisticimages conditioned on text descriptions.

机译：从文本描述中合成高质量图像是计算机视觉中一个具有挑战性的问题，并且具有许多实际应用。现有的文本到图像方法生成的样本可以大致反映给定描述的含义，但是它们无法包含必要的细节和生动的对象部分。在本文中，我们提出了堆栈式生成对抗网络（StackGAN）来生成以文本描述为条件的256x256逼真的图像。我们通过草图细化过程将难题分解为更易于管理的子问题。 Stage-I GAN根据给定的文本描述来绘制对象的原始形状和颜色，从而生成Stage-I低分辨率图像。 Stage-II GAN将Stage-I的结果和文字描述作为输入，并生成具有逼真的细节的高分辨率图像。它能够纠正第一阶段结果中的缺陷，并在优化过程中添加引人注目的细节。为了提高合成图像的多样性并稳定对条件GAN的训练，我们引入了一种新型的条件增强技术，该技术可增强潜在条件流形中的平滑度。在基准数据集上进行的大量实验和与最新技术的比较表明，所提出的方法在生成以文本描述为条件的逼真的图像方面取得了显着改进。

著录项

作者
Zhang, Han; Xu, Tao; Li, Hongsheng; Zhang, Shaoting; Wang, Xiaogang; Huang, Xiaolei; Metaxas, Dimitris;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks [J] . Zhang Han, Xu Tao, Li Hongsheng, IEEE Transactions on Pattern Analysis and Machine Intelligence . 2019,第8期

机译：StackGAN ++：具有堆叠式生成对抗网络的逼真的图像合成
2. Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network [J] . Wang M., Lang C., Feng S., Journal of visual communication & image representation . 2021,第Jana期

机译：通过链接的深度经常性发生的对抗网络来发送给照片 - 现实图像合成文本
3. Text to image synthesis using multi-generator text conditioned generative adversarial networks [J] . Zhang Min, Li Chunye, Zhou Zhiping Multimedia Tools and Applications . 2021,第5期

机译：使用多个发电机文本调节生成的对冲网络文本为图像合成
4. StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks [C] . Han Zhang, Tao Xu, Hongsheng Li, IEEE International Conference on Computer Vision . 2017

机译：Stackgan：与堆叠生成对策网络的照片 - 现实图像合成文本
5. Stacked Generative Adversarial Networks for Learning Additional Features of Image Segmentation Maps [D] . Burke, Matthew. 2020

机译：用于学习图像分割图的其他特征的堆叠生成的对抗网络
6. Image synthesis of monoenergetic CT image in dual‐energy CT using kilovoltage CT with deep convolutional generative adversarial networks [O] . Daisuke Kawahara, Shuichi Ozawa, Tomoki Kimura, 2021

机译：用深卷积生成对抗网络使用千伏CT的双能CT中单能仪CT图像的图像合成
7. StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks [O] . Zhang, Han, Xu, Tao, Li, Hongsheng, 2017

机译：stackGaN ++：堆叠生成的逼真图像合成对抗网络

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

摘要

著录项

相似文献

相关主题

期刊订阅